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(54) Camera on-l>oard voice recognition 

(57) In a camera, apparatus and methods for record- 
ing or storing voiced in messages in message menrary 
associated with an exposed image frame for reproduc- 
tion as an annotation with prints made from the exposed 
image frames. The camera includes a voice recognition 
system for voice recognition of words spoken by the user 
before the words are stored in message memory Fixed 
and adjustable vocabularies are provided Ibr use in the 
voice recognition. The adjustable vocatxjiary may be 
loaded into the camera by a vocatxjIary memory card or 
through an interface with a docking station of a personal 



computer or vending machine so that an adjustable 
vocabulary of words of interest to the camera user or 
related to a scene or event of photographic interest may 
be employed in the voice recognition. The messages 
may be recorded on magnetic film layers or stored in 
camera on-board memory or in detachable message 
memory modules to be fonArarded with the exposed film 
strip to a photofinisher for read out and printing on the 
prints. The messages may be stored or recorded by the 
user in real time with each exposure or at an earlier or 
later time. 
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Description 

CROSS-REFERENCE TO RELATED APPLICATION 

Cross-reference Is hereby made to commonly 
assigned U.S. Patent Application Serial No. (Docket 
68.052A) filed on even date herewith to VOICE RECOG- 
NITION OF RECORDED MESSAGES FOR PHOTO- 
GRAPHIC PRINTERS in the names of Dale F. Mclntyre. 
Bryan D. Bernardi and Dana W. Wolcott. 

FIELD OF THE INVENTION 

This invention relatesto the field of photographicfilm 
cameras and film processing, and particularly to appa- 
ratus for providing voice information storage and retrieval 
capabilities employing speech recognition. 

BACKGROUND OF THE INVENTION 

A variety of advanced still camera systems have 
been recently disclosed in which data related to the 
scenes photographed Is input into memory associated 
with the film strip to be fonvarded to the photofinisher to 
aid in making prints, and. in some instances including 
messages or annotations to be associated with the 
prints. As set forth In commonly assigned U.S. Patent 
No. 5.276,472. (incorporated herein by reference in its 
entirety) systems for providing voice recording in cam- 
eras in association with the exposure of each film image 
frame have become known in the art. In the "472 patent, 
a system is provided for recording a message in trans- 
parent magnetic film tracks in a magnetics-on-f ilm (MOF) 
layer on the non-emulsion side of the film overlying the 
image frame area. The user may speak words into a 
camera microphone/speaker that are processed into dig- 
ital signals stored temporarily in memory When the mes- 
sage is completed, the user may command the camera 
to play it back audibly for review and editing as consid- 
ered necessary. When the message content is consid- 
ered satisfactory, the digitized annotation may be 
recorded in the MOF layer tracks of the exposed image 
frame during advancement of the film to the next image 
frame. The user may continue recording messages for 
each image frame exposed, and the fully exposed film is 
processed so that the resulting prints can-y the annota- 
tions corresponding to the messages recorded. 

The "472 patent is directed to such a system where 
the coding of each message on the associated print is 
done during the photofinishing operation in such a way 
that the machine readable coded information allows the 
audible reproduction of the message through the use of 
a special hand-held scanner. The coded information is 
in the form of a bar code, a blister spot pattern or the like 
that may be scanned and translated into an audio voice 
reproduction by the hand-held scanner moved over the 
coded information. A brief alphanumeric place and date 
annotation may also be printed in the border of the print. 



The system disclosed in the "472 patent thus simply 
phonetically processes the speech that is recorded into 
coded information, and the hand-heW scanner phoneti- 
cally reproduces the words. A brief, alphanumeric place 

5 and date annotation may also be printed in the border of 
the print by the user or the photofinisher from listening 
to and manually transcribing the recorded message. The 
quality and accuracy of reproduction of the message 
depends on how carefully and clearly the words are spo- 

w ken by the user during the editing and re-recording oper- 
ation. 

In order to store such information, as well as the 
image frame identification to which it pertains, and other 
information automatically entered from the camera oper- 
as ating system or optionally entered by the camera user, it 
is necessary to employ such a further writable and read- 
able media in association with either the film (as dis- 
closed in the "472 patent) or in some other storage 
media or mennory. 
20 With respect to recording camera operating condi- 
tions and time and date for each image frame exposure 
other than on the film itself, it has been proposed to mag- 
netically read and write data on magnetic strips formed 
on the sides or an end of the film cartridge, as described, 
25 for example, in U.S. Patent No. 4,443,077. More recently, 
it has been suggested that such data may be stored and 
retrieved from non-volatile memory chips, e.g. an EEP- 
ROM, incorporated in an integrated circuit chip "card" as 
set forth in U.S. Patent No. 5.128.700. The card may be 
30 separable from the camera and film cartridge, or a similar 
EEPROM card may be attached to the film cartridge as 
set forth generally in U.S. Patent No. 5,070,355. Alterna- 
tively, the storage of such information in "ROM-ICs" 
attached permanently or releasibly to the sides or ends 
35 of film cartridges is disclosed in U.S. Patent No. 
5,142,310. 

The "700 patent also stores sound or voice mes- 
sages related to the image frames in the renxivable 
sound cards that are intended to accompany the film 

40 when it is sent in for processing, so that the message 
may be reproduced as a sound code with the print made 
from the negative film image frame for phonetic playt>ack 
of the message. Alternatively, the card itself is read out 
phonetically As in the "472 patent, the sound is repro- 

45 duced phonetically as the coded information or recorded 
data is scanned. 

In anotiier enrtbodiment disclosed in the " 700 patent 
and in a further U.S. Patent No. 4,344,682, a camera is 
described for recording information related to each 

50 image frame as small alphanumeric characters exposed 
in a corner portion of the image frame for photographic 
reproduction on tiie print made from the negative image 
frame. The information may be recorded or stored in tem- 
porary memory prior to making the exposures of the 

55 image frames arxi exposed on the image frames in con- 
junction with the image frame exposure. As each mes- 
sage is inputted into memory, rt is displayed and may be 
edited. The input mechanism depicted is a keytx)ard, but 
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it is suggested that other input means, including a voice 
recognition device, may be substituted for the keyboard. 

In an electronic still camera disclosed in U.S. Patent 
No. 4,742,369, it is also suggested that a keyboard or a 
voice recognition circuit be employed to input spoken 5 
information to be stored regarding each image that is 
captured by the camera. 

In a further camera system. e.g. that disclosed in 
U.S. Patent No. 5,027,149, voiced commands are given 
by the user to command each camera operation. A voice io 
recognition circuit is employed In a system for training 
the camera to recognize and respond accurately to the 
spoken commands. 

Problems to be Solved t?y thQ Invention is 

It Is difficult to accurately reproduce spoken words 
stored in analog or digital form in a non-photographic 
media associated with film strip image frames as alpha- 
numeric, readable annotations on the prints made from 20 
the image frames. It is desirable to make the reproduc- 
tion without human inspection of the annotation before it 
is printed in the border of the print. It is not possible to 
rely sinply on the editing process and careful pronunci- 
ation of the words to ensure that they are processed 25 
accurately before they are stored for later read out and 
printing as annotation on the prints made from the expo- 
sures. 

SUMMARY OF THE INVENTION 30 

It is therefore an object of the present invention to 
provide a method and apparatus for ensuring the accu- 
rate processing of spoken words that can be accurately 
reproduced by photofinishing equipment. 35 

These and other objects of the invention are realized 
in a photographic camera including an optical lens, a 
photographic f ilmstrip transport mechanism for advanc- 
ing the filmstrip in a path of travel to and through an 
image frame exposure gate with respect to said optical 40 
tens, and an exposure system for making an exposure 
of the filmstrip image frame in the exposure gate, appa- 
ratus for recording a voice message related to the expo- 
sure made or to be made for playt>ack in conjunction with 
making prints from the photographic images captured in 45 
the image frames of the filmstrip to provide for the print- 
ing of the voice message therewith conrprising: speech 
input means into which a camera user may speak words 
of the message to be stored with respect to the filmstrip 
image frames; sound processing means for processing 50 
the words spoken into the speech input means as voice 
digital data: means for providing reference voice digital 
data corresponding to a reference word vocabulary; 
speech recognition means for comparing the processed 
voice digital data to the reference voice digital data and ss 
recognizing processed voice digital data conresponding 
to the reference vocabulary voice digital data; message 
memory means having memory locations related to each 
image frame of the filmstrip for storing recognized voice 



digital data; and means for storing the recognized voice 
digital data in said message memory means. 

A variety of vocabulary sources may be employed to 
load in a fixed vocabulary and adjustable vocabulary of 
voice digital data corresponding to commonly used 
words and words specific to an event or attraction of pho- 
tographic interest. The vocabulary sources may be 
detachable vocabulary memory cards insertable into the 
camera for connection with the camera system or may 
comprise an interface for down loading vocabulary words 
from a camera docking station. 

Similarly, the message memory means may com- 
prise memory media associated with the film strip and 
detachable with the film cartridge for transfer to the pho- 
tofinisher or may remain in the camera if the camera is 
a single use, recyclable camera returned with the film 
cartridge to the photofinisher for processing. 

The user may employ methods of recording or stor- 
ing the messages in real time with the exposure of each 
image frame employing voice recognition or at a later 
time. 

Advantages o f the Invention 

The invention advantageously results in the storage 
of accurate word messages in relation to film strip image 
frames that may be automatically read out and accu- 
rately printed as readable annotations on the associated 
print without requiring the photofinisher to interpret and 
correct the message before it is printed. 

BRIEF DESCRIPTION OF THE DRAWINGS 

These and other objects, advantages and features 
of the invention will be become apparent from the 
detailed description given hereinafter in relation to the 
accompanying drawings, in which: 

Rgure 1 is a diagram illustrating partial dedicated 
tracks in a virtually transparent MOF layer and a 
cross section of the layers of film particularly 
adapted for use in a camera of a first embodiment 
of the invention; 

Rgure 2 is a schematic illustration of a camera hav- 
ing speech recording apparatus in accordance with 
the various embodiments of the invention; 
Rgure 3 is a schematic block diagram of a system 
for recording speech in a camera in accordance with 
the first embodiment of the invention; 
Rgure 4 is a schematic block diagram of a system 
for recording speech in a camera in accordance with 
a first variation on the first embodiment of the inven- 
tion; 

Rgure 5 is a schematic block diagram of a system 
for recording speech in a camera in accordance with 
the second embodiment of the invention; 
Rgure 6 is a schematic block diagram of a system 
for recording speech in a camera in accordance with 
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a first'variation on the second embodiment of the 
invention; 

Figure 7 is a flew chart of the camera based voice 
recognition steps taken to accurately store and 
record each word of a message voiced by the user; s 
Figure 8 is a flow chart of the steps of the transfer to 
print operation undertaken by the photo-finisher; 
Figure 9 is a flow chart of the combined steps of 
recording a message in real time relation to the 
exposure of each image frame and the finishing of 
the prints with the message appearing as an anno- 
tation on the print; 

Figure 10 is a flow chart of the combined steps of 
recording a message using terms common to a 
series of exposures and the finishing of the prints 
with the common terms appearing as an annotation 
on each print; and 

Figure 1 1 is a flow chart of the combined steps of 
recording a voice message temporarily for each 
image frame and editing and re-recording the edited 
messages at a later time and the finishing of the 
prints with the message appearing as an annotation 
on the print. 

DETAILED DESCRIPTION OF THE PREFERRED 
EMBODIMENTS OF THE INVENTION 

In accordance with an aspect of the invention, the 
camera operating system includes speech recognition of 
spoken words which are compared to an on-camera 
word vocabulary stored in fixed vocabulary ROM and 
adjustable vocabulary RAM or EEPROM or the like as 
desaibed hereafter. Spoken words are processed and 
compared to the vocabulary The acceptance of the word 
is indicated by displaying it to the user on the camera 
LCD display or by audibly playing back the closest 
matching word. Rejection of the word may also be indi- 
cated to the user. The user may either speak the word 
or an alternate word and repeat the process until the 
word is matched ar>d accepted. In this aspect of the 
invention, the recognized words form a message that is 
stored in memory associated with the film image frame 
so that the memory accompanies the film to the photof in- 
isher where each message can be read out and printed 
along with the respective print made from the film in^ge 
frame. The speech recognition operation results in a 
more automated photof inishing operation not requiring 
constant operator monitoring and translation of the mes- 
sages into more readable text. 

Due to the space and power supply limitations inher- 
ent in miniaturized still cameras, it is not possible at this 
time to provide a large scale vocabulary covering all 
words of a given language in the camera or to provide 
the processing speed sufficient to effect tfie connparison 
of the voice digital data to the memorized word data in a 
reasonable time period. In a further refinement of the 
invention, it is proposed that the vocabulary of words 
likely to be spoken by the user to describe the scene 
being photographed include a fixed vocabulary and an 



adjustable vocabulary word set. The speech recognition 
operating system is constructed to accommodate the 
fixed vocabulary of common words stored in ROM likely 
to be used in nnost situations, e.g. the months of the year. 
The system accomnxxiates an adjustable vocabulary or 
vocabularies of other words selected by the user and 
stored in RAM or EEPROM in advance of using the cam- 
era and which may be related to specific persons, events 
or attractions and events. The manner of storing an 
adjustable vocabulary may include a variety of means 
and sources, including words keyed in on a camera 
mounted keyboard, words stored in a personal computer 
and downloaded through a connector interface, or words 
stored in plug-in RAM or EEPROM cards inserted into 
special slots of the camera. 

Voice recognition systems that are available com- 
mercially function in a variety of ways to process speech 
or voiced wards to derive a match to a pre-stored word 
in a vocabulary mevnory unit. Voice recognition systems 
may be used with vocabulary menrtory units which 
involve eitiier "speaker independent training" or "speaker 
dependent training". Speaker independent training 
means that each word in a given vocabulary has enough 
training patterns stored in memory in association witii it 
to distinguish that word from any other word in the vocab- 
ulary regardless of the pronunciation of the word by a 
general population of potential users. The training pat- 
terns for the various pronunciations of such words are 
determined in advance and stored for the words of the 
vocabulary memory unit before it is supplied to the user 
in the camera or otiienAnse as described below. Such 
words should be recognized by the system when spoken 
by a fairly broad spectrum of the population and do not 
require the user to undertake a personalized training reg- 
imen of the vocatxjtary words prior to voicing the words. 
However, storing a sufficient number of training patterns 
for each word to ensure reliable recognition is expensive 
and does take up memory space. 

Speaker dependent training means that the words 
of the vocabulary are trained to be recognized when spo- 
ken by a specific person, that is, the trainer and user of 
tiie vocabulary. In a training session initiated by tiie cam- 
era and the user, the words in tiie vocabulary may be 
displayed one at a time, and the user voices the words 
irtto the camera microphone. A training pattern is gener- 
ated and stored as tiie user voices each word in tiie 
course of completing tiie training session. For example, 
if a fixed vocabulary is subjected to speaker dependent 
training, tiie speaker is pronpted by a displayed or aural 
command to voice each word of the vocabulary one or 
more times in sequence. Each spoken word is then proc- 
essed and tiie processed signal is stored as the pattern 
to be recognized for that word in the future. While tiie 
process may be time consuming, tiie memory space 
required for tiie unique training pattern associated with 
the word is reduced. And, of course, the speech recog- 
nition system for the speaker dependent vocabulary 
words may well be unusable by other persons. 
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Commencing with the various emlKxliments of the 
invention, Figures 1 and 2 depict, in conjunction with the 
camera circuits of Figures 3 and 4, a first embodiment of 
the invention in which the recognized voice digital data 
may be stored in the MOF layer tracks, and the fixed and 
adjustable vocabulary may be stored in the camera in a 
variety of ways. In this embodiment, the recognized 
words are stored in certain of the MOF layer tracks and 
are read out at the photof inisher to be printed along with 
the prints made from the film image frames. 

The camera circuits of Figures 3 and 4 differ in the 
type of vocabulary memories employed, as desaibed in 
detail below. Figures 5 and 6 depict a further embodi- 
ment employing camera circuits similar to Figures 3 and 
4 but for recording the voice recognized memories in 
separate memory modules, rather than on the MOF layer 
of Figure 1 , as described below. In all of the Figures 1 - 
6. the source of or means for providing the vocabularies 
may take a variety of forms as listed above. Figures 7 - 
1 1 illustrate possible voice recognition and storage meth- 
ods employing the apparatus of Figures 1 - 6 for in-cam- 
era voice recognition prior to permanently recording the 
messages. 

Referring first to Figure 1 , a strip 1 0 of magnetically 
coated color negative film, 35 millimeters wide, useful in 
the present invention includes a base 11, various well- 
known photo-chemical layers 12 on one side of the base 
1 1 and a virtually transparent MOF layer 1 3 on the other 
side. An anti-static and lubricating layer 14 overlies the 
magnetic layer 1 3. The film strip 1 0 includes perforations 
15 along the film edge at regular intervals matching the 
pitch of a metering pawl in a camera adapted to use the 
film strip 10. 

For purposes of recording data in the MOF layer 13. 
each frame of the film strip 10 may be formatted as 
shown in Figure 1 and more fully described in commonly 
assigned US. Patent No. 4,977,419, the disclosure of 
which is incorporated herein by reference. More specifi- 
cally, the frame area is divided into a plurality of prede- 
termined longitudinal track locations designated in the 
drawing as outermost tracks C0-C4 and innermost 
tracks F00-F29. As described more fully in the "419 pat- 
ent, certain of the tracks may be reserved for recording 
of information in the camera using magnetic recording 
means included in the camera. In addition, other tracks 
may be resented for use by the photofinisher. Addition- 
ally, the "419 patent indicates that certain of the tracks 
may be used for recording of audio information. Appara- 
tus for magnetically recording information in the camera 
is more fully described in the "419 patent and is not 
repeated here except to the extent elements thereof are 
relevant to an understanding of the present invention. 

Referring to Figure 2, a camera 16 is schematically 
illustrated with a variety of features and components usa- 
ble separately or in various cont)inations in the systems 
and methods desaibed below. In a first emtxxliment of 
the invention, the camera 16 is specifically adapted to 
receive and function with film having the MOF layer 13 
of Figure 1 . Camera 16 is provided with a built-in audio 



transducer, e.g. microphone 17, an internally mounted 
micro-chip 18, a magnetic recording head 19 and a min- 
iature speaker 20. Camera 1 6 may also be provided with 
an LCD panel 30 for displaying various camera settings 

5 and conditions of the type well known in the art. In addi- 
tion, the LCD panel 30 may display words that it has rec- 
ognized to the user for confirmation of the recognition as 
described below. Various conventional user inputs 22 are 
also provided on the camera 16. 

10 The camera 1 6 may also have an external interface 
32 for receiving and transmitting vocatxjlary words to 
memory in micro-chip 18 or for reading out message 
words stored in such memory. For example, the external 
interface 32 may include an RS-232 port, so that camera 

15 memory in micro-chip 1 8 may be accessed through a 
computer based docking station 50 to load an adjustable 
vocabulary of words chosen by the user or obtained at a 
particular attraction or event by inserting the camera 16 
into a docking station of a vending machine or the like. 

20 In the event thjat the camera is recyclable, the interface 
32 may be employed by a docking station 50 operated 
by the photofinisher to read out the messages stored in 
camera on-lx)ard rriemory. In such a case, it would not 
be necessary that the camera also employ the MOF layer 

25 13 on the film strip 10 or the cartridge related memory 
module described below to record or store the message 
the user wishes to appear on the prints. 

Alternatively the external interface 32 may comprise 
a keyboard on the camera of the type described atx}ve 

30 in reference to the "682 patent for keying in vocabulary 
words for voice recognition as one manner of storing the 
adjustable vocabulary words. 

Figure 2 also schematically illustrates a message 
memory module 38 that may be inserted into a slot or 

35 attached to a film cartridge 36 in the camera 1 6 so that 
the recording of the image frame related messages may 
be made in the memory module 38 rather than in the 
MOF layer of the film strip 10. Such a memory module 
38 may take the form of the film cartridge end attached 

40 modules of commonly assigned, co-pending U. S. Patent 
Application S. N. 071,084 entitled ORIENTATION INDE- 
PENDENT DETACHABLE FILM CARTRIDGE. MEM- 
ORY MODULE filed on June 4. 1993, in the name of J. 
David Cocca and S. N. 071 .096 entitled ORIENTATION 

45 INDEPENDENT DETACHABLE FILM CARTRIDGE, 
MEMORY MODULE filed on June 4. 1993 in the name 
of Robert S. Bryant. Alteratively, the memory module into 
which the messages are recorded may be a plug-in IC 
card of the type described in the above-referenced " 700 

50 patent or the cartridge mounted ROM-ICs described in 
the above-referenced "310 patent or any other conven- 
ient form. 

Rnally, Figure 2 also shows a separate vocabulary 
memory card 34 that may be inserted into a slot connec- 
55 tor in the camera body for loading a particular fixed 
vocatjulary. The fixed vocabulary memory card 34 may 
take the form of a ROM-IC card of the type described in 
the atwve-referenced "700 patent. Again, the vocabu- 
lary memory card 34 may be vended to the user at an 
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event or attraction of photographic Interest to customize 
and expand the vocabulary of fixed words that may be 
recognized for a particular event or attraction of photo- 
graphic interest. Ideally, the fixed vocabulary memory 
card 34 would have speaker independent training so that 
the purchaser would not have to spend the time training 
the words for recognition. 

It will also be understood that the memory card 34 
may also constitute or include adjustable memory so that 
a user could store adjustable words in the card using the 
camera to write into and read from the memory in the 
card or using a further system for writing words into the 
memory card. In such a case, speaker dependent train- 
ing would more likely be required. 

It should be noted that the functions of the vocabu- 
lary memory card 34 and the message memory module 
38 could be combined to operate as the memory in which 
recognized voice messages are stored in the same man- 
ner as the memory module 38 or the MOF layer 13 as 
desaibed above and to provide a vocabulary for the 
voice recognition unit. The combination of the message 
memory module 38 and vocabulary memory card 34 as 
both the source of the vocabulary words and the repos- 
itory of the messages recorded for each image frame as 
the recognized voicedigital data has certain advantages. 
The recognized voice digital data can be coded to the 
memory addresses for the vocabulary words, rather than 
repeating the code for the word itself. 

In all of the above described emlxxiiments, the mes- 
sage memory module 38 (to be attached to a cartridge 
or formed in the film cartridge) and/or the memory card 
34 may both be purchased or provided with the purchase 
of a film cartridge from a vending machine or over the 
counter at an event or attraction of photographic interest. 
The memory card vocabulary in either case would con- 
tain words useful to recognize messages expected to be 
recorded at that attraction or event. 

Many of the means and methods for providing a 
vocabulary memory or for recording the voice recognized 
messages in memory supplied with the film cartridge to 
the photofinisher described in the preceding paragraphs 
are depicted redundantly in Figure 2. Similarly the 
embodiments of Figures 3-6 include redundant vocabu- 
lary memory sources as described below. It will be 
understood that not all of these means or methods are 
necessarily present In a single camera. 

Referring to Figures 3 and 4, there is shown an 
expanded block diagram of a first embodiment of the cir- 
cuits and components or system included In camera 16 
for recording voice recognized messages in MOF layer 
13. The circuits of Rgures 3 and 4 are for the most part 
embodied in micro-chip 18, except for the memory units 
42, 44 and 46. 

User inputs 22 comprise buttons or switches which 
condition the camera system microcontroller 23 to initi- 
ate and control the various operating functions of the 
camera, inctuding the sound recording and playback 
functions of the present invention, as well as the conven- 



tional camera auto focus and auto exposure functions, 
shutter release, film advance and the like. 

Microphone 17 and speaker 20 are coupled to ana- 
log amplifier and data processing circuit 24 to input and 
5 play back the voiced message in one mode of operation 
of the circuit. A sound processor Integrated circuit (IC) 

25 serves to convert analog signals Input from micro- 
phone 17 Into coded digital information suitable for stor- 
age In an on-board digital memory 26 and for converting 

10 the stored digital Information Into analog signals suitable 
for playback through speaker 20. Sound processor 25 
may be a Texas Instruments TMS3477, and memory 26 
may be a random access memory (RAM) such as a 
Hitachi HM 628128. 

15 ArK>ther function of on-board memory 26 is to serve 
as a temporary storage for the message associated with 
an Individual exposed image frame after voice recogni- 
tion is completed and prior to recording the message on 
the MOF layer 13 of film strip 10. For this purpose, mem- 

20 ory 26 is coupled by the microcontroller 23 in the Vrlte" 
mode to film read/write interface circuits 27 to record the 
stored audio data onto the MOF layer 1 3. A film advance 
motor controller 28 operates at appropriate times to 
cause film advance motor 29 to advance the film 10 in 

25 either the frame-to-frame direction or In the film 
prewind/rewind direction, the latter depending on the 
nature of the camera involved. The messages in memory 

26 are recorded in the MOF layer tracks during such film 
advancement. 

30 Turning to the vocabulary memory units 42, 44 and 
46 depicted in Figure 3, the voice recognition of the spo- 
ken words of the message is effected by tiie voice rec- 
ognition unit 40 under the control of the microcontroller 
23. The voice recognition unit 40 is coupled to stored 

35 vocabulary words contained in fixed vocabulary memory 
units 42 or 44 and the adjustable vocabulary memory unit 
46. 

The memory unit 42 stores speaker Independent 
training patterns for the fixed vocabulary and. In Figure 

40 3, may be on-board ROM that Is loaded at manufacture 
of the camera and cannot be changed. By contrast, the 
fixed alternative vocabulary memory unit 44 employs 
speaker dependent training for tiie fixed vocabulary 
through combined on-board ROM and RAM. The fixed 

45 vocabulary words are stored in ROM, and the speaker 
trained word patterns for each word are stored during ttie 
training routine in RAM to be recognized in the later use 
of the camera voice recognition function. 

Adjustable vocabulary memory unit 46 may be on- 

50 board RAM or EEPROM and may enploy either speaker 
dependent or independent training in the contexts 
described below. In a first variation, the adjustable 
vocabulary word memory unit 46 is provided in order to 
allow the user to store word sets of his or her choice that 

55 may relate to persons or relate to the taking of photo- 
graphs at varioLS attractions or events. The adjustable 
vocabulary menrrory unit 46 of Rgure 3 is loaded witii 
adjustable vocabulary words through the data interface 
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32. In this case, the adjustable vocabulary words would 
require speaker deperKlertt training. 

The data interface 32 may include a port for a cable 
from a personal computer, so that the adjustable vocab- 
ulary words may be inputted via the computer keyboard 5 
or memory. In cameras equipped with a miniature key- 
board, e.g. that shown in the above-referenced "149 pat- 
ent, the vocabulary words may be keyed into the 
adjustable vocabulary memory unit 46 through the key- 
board on the camera body and displayed to the user on 
the LCD display unit 30. In a further keyboard system for 
entering letters of a word of the type used in the Magna- 
vox® Smart Talk" VCR remote control unit, a "joy stick" 
is used to select letters from an alphabet appearing on 
the LCD display to spell out the words of a vocabulary. 
The speaker dependent training routine as described 
above may follow loading of the word or words in any of 
these manners. In this way. the user may input words into 
the adjustable vocabulary memory unit 46 before the 
camera is used for any specific event. 

In a second variation, the camera may be fitted into 
a docking station 50 of a vending machine or the like to 
make a connection to the external interface 32 port for 
data transfer via the RS-232 standard from memory in 
the vending machine. For example, such vending 
machines may be provided to dispense film cartridges at 
a theme park and to down load adjustable word vocab- 
ularies. The camera interface 32 may be inserted into 
the docking station 50. and the adjustable vocabulary 
memory unit 46 may be loaded when the film cartridge 
is dispensed. 

In this case, the down loaded adjustable vocabulary 
may include speaker independent training patterns for 
the adjustable vocabulary that is loaded. If the vocabu- 
lary is speaker dependent, then the user would proceed 
to complete the speaker dependent training described 
above after loading of the words through the interlace 32. 

In a further simplification of the system depicted in 
Figure 3. only a single adjustable memory unit 46 may 
be provided in the camera into which the entire vocabu- 
lary word set is loaded through the data interface 32 by 
any of the above desaibed means and methods. In this 
case, no distinction would be drawn between the fixed 
and adjustable vocabulary word sets, except that a por- 
tion of the RAM or EEPROM memory locations might be 
loaded through a docking station 50 and interlace 32 with 
event or attraction related word sets (speaker independ- 
ent or dependent), and other memory locations might be 
loaded by the user through the interface 32. The memory 
locations may be separately designated or tagged so 
that the adjustable vocabulary entered by the user is not 
written over during a sut»sequent down toad from the 
docking station 50. 

Turning to the entKXIiment of Figure 4, the system 
depicted differs from Figure 3 in the types of memory 
units provided. In this system, the plug-in memory units 
42*. 44' and 46* may each comprise part of a single 
vocabulary menrkory IC card 34, or separate vocabulary 
memory IC cards 34 may be provided for each memory 



unit 42' or 44' and 46'. In other words, in one variation, a 
single, insertable and replaceable memory card 34 has 
within it ROM and RAM or EEPROM with memory loca- 
tions dedicated to the storage of word patterns for the 
fixed and adjustable vocabulary words that are schemat- 
ically depicted as the plug-in memory units 42', 44' and 
46'. In the other variation, the camera may receive one 
of the plug-in fixed vocabulary memory units 42' or 44' 
or 46' as an IC card 34 that is inserted into a camera slot. 
The same camera user may be able to program separate 
plug-in adjustable vocabulary memory units 46' for dif- 
ferent situations or uses of the camera. Multiple users of 
the same camera may be able to program separate plug- 
in adjustable vocabulary memory units 46' and keep 
them for their own use of the camera. The camera may 
have more than one slot for receiving more than one IC 
card, and the slot or slots are connected by a data bus 
to the microcontroller 23. 

Figures 5 and 6 are alternative embodiments of the 
invention to Rgures 3 and 4, respectively, wherein the 
recording of the image frame related messages is in the 
separate memory module 38 rather than in the MOF 
layer 13 of the film strip 10. Such a memory module 38 
is schematically illustrated in Figure 2 as an alternative 
storage format to the MOF layer 13 on the film strip 10 
of Figure 1 and described above. In each case in Figure 
6, a portion or all of the memory units 42', 44' and/or 46' 
may be physically incorporated into the separate mem- 
ory module 38. For example, the adjustable memory unit 
46' of Figure 6 may be incorporated into a memory mod- 
ule that is sold at an event or attraction where film car- 
tridges are sold over the counter or vended from a 
vending machine. 

The general operation of a camera embodying any 
of the above desaibed embodiments and variations after 
the vocabularies are loaded, and when it is desired to 
record sound in association with taking a picture, is now 
described. The camera user selects a sound recording 
mode via a user input selector switch 22 that causes the 
camera system microcontroller 23 to set the digital mem- 
ory to the Vrite" mode and then enables the analog 
amplifier and data processing circuit 24 for audio record- 
ing. Assuming that the user desires to record image- 
related audio, the user talks into the camera microphone 
1 7 to identify the scene with appropriate information, e.g. 
picture taking location, people in the scene, or other 
information. The user may also verbally initiate recording 
of information originating in or under the control of the 
camera itself. e.g. date and time (from an internal digital 
dod^, f-stop, shutter speed, frame number, and other 
camera operations. The data processing drcuit 24 and 
sound processor IC 25 convert the incoming analog sig- 
nal to coded digital data which is then recorded In the 
digital memory 26. Audio may be recorded into memory 
26 in this manner before, during or after the picture-tak- 
ing event as described further below. 

Once having recorded the audio message in mem- 
ory 26, it is then possible to review the message for con- 
tent via speaker 20 in the camera. To do this, the user 
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selects the "review" mode by means of a user input 22 
which causes the microcontroller 23 to set the menrwry 
26 to the "read" mode thereby enabling the sound proc- 
essor 25 and the analog amplifier and data processing 
circuit 24 to play back audio through speaker 20. If the 5 
recorded message is not satisfactory, the user can easily 
change It by simply repeating the recording process 
desaibed above. 

During this composing and editing process, in 
accordance with one aspect of the present invention. 10 
each spoken word of the message is compared in the 
voice recognition unit 40 to the word patterns stored in 
fixed vocabulary memory units 42/42' or 44/44' and the 
adjustable vocabulary memory unit 46/46', as illustrated 
in the voice recognition flow chart of Figure 7. At the start, is 
the user speaks a single word in step S10. The voice 
recognition unit 40 makes the comparison of the digi- 
tized, voiced word to the stored vocabulary of digitized 
words and locates the closest match in step SI 2. The 
word may be recognized In step SI 4 in several ways. In 20 
one way, the closest match word may be displayed on 
the LCD panel 30 in a display mode, or the matched word 
may be audibly played back by the speaker 20 using 
speech synthesis techniques well known in the art. If the 
displayed or played back word is not correct, the user 25 
may indicate non-recognition of the word through a user 
input 22 or by voicing a simple, unambiguous negation 
command that may be recognized only in the display 
mode in step SI 6. A further match may then be 
attempted by the voice recognition unit 40 or the user 30 
may interrupt the recognition process to again voice the 
word or a different word In step SI 0 and repeat the proc- 
ess. The user Indicates acceptance of the word dis- 
played through a user input 22 or by voicing a simple, 
unambiguous affirmation command that may be recog- 35 
nized only in the display mode in step S14. The micro 
controller 23 responds by storing the digitized word from 
the memory unit 42, 44 or 46 into the digital memory 26 
and prompting the user to speak another word in step 
SIS. The display and acceptance of the words of the 40 
message may be conducted word by word until the entire 
message Is recognized and stored in memory 26. As 
described hereafter, the accepted message is recorded 
in the MOF layer 1 3 or the memory module 38 before the 
film cartridge is removed from the camera arxJ provided 45 
to the photof inisher. 

Figure 8 illustrates the transfer-to-print process of 
translating the voice recognized and stored or recorded 
message into an annotation on the border of a print made 
from the image frame. At step S20, the recorded mes- so 
sage or annotation is read out from either a memory 
nxxJule or the MOF layer of the film strip and temporarily 
stored. The annotation is transferred to the photosensi- 
tive print border by an alpha-numeric character print 
head positioned in the photographic printer to make the ss 
exposure of the annotation during the passage of the 
print paper through in the photographic printer In step 
822. 



The photofinishing system employed to transfer the 
message to the print is similar to that depicted in Figure 
4 of the above incorporated "472 patent. In that Figure 
4, the messages recorded in the MOF layer tracks for 
each image frame are read out and converted to a bar 
code or blister code format that is imprinted on the print. 
In the present case, the recorded messages are read out 
from either the MOF layer tracks or the memory module 
provided to the photof inisher with the film cartridge, con- 
verted (if necessary) to an alpha-numeric character font, 
and directly printed on the print in any convenient place 
using any conventional character printer. 

Tlie read out of the messages recorded in the MOF 
layer tracks may be accomplished by the head and play- 
back circuits depicted In Figure 4 of the "472 patent as 
the film strip is advanced, and the digitized messages 
are converted by the data converter into a format usable 
with the bar code or blister code printer. Rather than the 
bar code or blister code printer, a conventional alpha- 
numeric character printer head would be employed to 
print the annotation on the print paper in accordance with 
the present invention. 

In order to accommodate the alternative memory 
module 38, the processor of Figure 4 of the "472 patent 
may be provided with a slot for reading out the data 
stored In the memory module and temporary memory for 
storing the read out data and applying it to the data con- 
verter. In the case where the camera is provided to the 
photofinisher with the film strip and cartridge in it, the 
stored messages may alternatively be read out through 
a docking station 50 coupled to the processor b>lock of 
the printer and making connection with the external inter- 
face 32 of the camera. In any case, the data Including 
the messages and the image frame identification would 
be read out, stored, converted to the printer character 
font format, and printed on the print made from the iden- 
tified image frame. 

While the recording of the voice recognized mes- 
sages on certain tracks of the MOF layer or in the mem- 
ory module have been emphasized in certain of the 
above-described embodiments, it will be recognized that 
the analog voice message may also be separately stored 
and recorded in adjac^ tracks or memory for a variety 
of reasons. For example, the notes recorded at the time 
of taking a photo, and later used to compose the edited 
message to appear as the print annotation may be 
retained for aoss-checking. The aural notes or com- 
posed message may be retained for playback as a 
"sound bite" by other means in association with the prints 
or othenwise. At step S24 the sound bites, if recorded, 
may be transferred to a recording medium, e.g. a tape 
cassette to be provided to the user with the annotated 
prints made from the negatives and the recorded mes- 
sages. 

Rgure 9 illustrates the connbination of the steps of 
the voice recognition method of Rgure 7 and the transfer- 
to-print method of Rgure 8 in one fashion for recording 
messages with the exposure of each irr^ge frame of a 
fOm strip. In step 830 the camera is used to make an 
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exposure of an image frame. The camera alerts the user, 
or the user simply proceeds to then invoke the method 
of Figure 7 in step S32. When ail words of the message 
are recognized in step SI 8, the message may be stored 
as described as follows in a memory module 38 with the 5 
image frame number or by recording the digitized words 
on the MOF layer in st^ S34. 

Following the taking of a picture and before the film 
strip is advanced to the next image frame by she film 
advance motor 29, the camera system microcontroller 10 
23 checks the status of the memory 26. If there is audio 
stored in the memory, it sets the memory to the "read" 
mode to pass the data from the memory 26 to the film 
read/write interface circuits 27 or the annotation module 
read/write circuits 37. In the system employing the MOF is 
layer recording technique desaibed in reference to Fig- 
ures 1 - 4. the microcontroller 23 activates motor control- 
ler 28 to cause motor 29 to initiate film advance to the 
next frame. The data transferred from the memory 26 to 
the recording interface circuits 27 is recorded on certain 20 
tracks of the MOF layer 13 during the film strip advance. 
Once the data is recorded on the MOF layer, microcon- 
troller 23 sets the status of memory 26 to "empty", thus 
preparing the memory 26 for the next recording event. 

In the embodiments of Figures 5 and 6, it is not nec- 25 
essary to synchronize the storage of the image frame 
messages in the memory module 38 with movement of 
the image frames. Instead, the messages may be trans- 
ferred, along with the frame Identification, Into the sepa- 
rate memory module 38 when the user indicates that 30 
each message is complete. Or the messages may be 
retained in memory 26 for later playback, revision and 
storage at any time before the menfK)ry module 38 is 
removed from the camera. In a further variation 
described below, recording of the edited messages on 35 
the MOF layers of the f llmstrip Image frames may also 
be done at a later time by storing the user's notes in rela- 
tion to each image frame in memory 26. 

In step S36, the decision is made to continue taking 
pictures on the film strip in the camera. When the film 40 
strip is fully exposed or it is decided to stop taking pic- 
tures, the user provides the film strip and associated 
memory module (If any) or the camera to the photofin- 
Isher to Invoke the transfer-to-prInt method of Figure 8. 
Thus, In the simplest use of the camera system embod- 45 
Iments and process described above, the anrx>ta1ions on 
the resulting prints contain messages that were recorded 
in association with each image frame, wherein each 
word of the voiced message is subjected to voice recog- 
nition in the camera. so 

Figure 10 illustrates a method Involving the voice 
recognition and transfer to print steps of Figures 7 and 8 
for selectively repeating the same message to appear as 
the same annotation on a series of prints made from a 
series of image frames. For exanple, given the time it ss 
may take to prepare and voice in messages to be 
recorded, it may be desirable to compose an annotation 
to appear on a number of prints that are to be taken (or 



may have been taken already) related to the same event 
or attraction being photographed. 

The process of Figure 10 is invoked each time that 
an exposure is made. At step S40 the user takes a pic- 
ture, and at step S41 the stored algorithm indicates if an 
earlier stored message is to be used again depending 
on commands entered previously at step S43. If "no", 
then the voice recognition process of Figure 7 is Invoked 
in step S42. At step S43, the user may input the number 
of succeeding Image frames to store the message with 
by activating a user input 22 or voicing a unique com- 
mand that is recognized by the voice recognition unit 40. 
At step S44, the message is stored or recorded in a man- 
ner previously described. The decision is made at step 
S45 to continue with the exposures of the film strip as 
described above, and, if so, then the process starts over 
with the next exposure at step S40. If not, then the trans- 
fer-to-print process of Figure 8 Is invoked at step S46 as 
described above. 

If such an entry to apply the same annotation to the 
next image frame has already been made In step S43, 
then the "yes" response at step S41 is satisfied. The 
same message is then stored with respect to that image 
frame at step S44 and the process continues as 
described above. It will be recognized that the order of 
the st^ of Figure 1 0 may be altered to accomplish the 
same result, by having the user indicate at step S41 that 
a previously stored message is to be used again in asso- 
ciation with the most recentiy exposed (or to be exposed) 
Image frame. Moreover, it wouki be possible to store not 
only a previously voice recognized message as well as 
adding a further message subjected to the voice recog- 
nition process for that Image frame. 

Turning now to Figure 11 , it depicts a further method 
for initially temporarily storing voiced-in messages or 
notes for each image frame and then at a later time, 
reproducing the messages, composing the annotation to 
appear on the print, and recording or storing the com- 
posed message using the voice recognition process of 
Figure 7 and the transfer-to-print process of Figure 8. 
Storing voiced-in-messages In the camera requires 
much less time during picture taking than if voice recog- 
nition were accomplished at the time of picture taking. 
The method of Figure 1 1 is most readily implemented in 
the embodiments of Figures 5 and 6 which is first 
described as follows. At step 850. the picture is taken by 
the user, and a descriptive note or phrase is voiced in for 
storage without voice recognition in memory 26. When 
all Image frames are exposed, or it Is othenArise indicated 
by the user that exposures are no longer to be made, or 
that the user now wishes to compose and store or record 
messages for the image frames exposed previously, at 
step S52, the temporarily stored voiced notes are played 
back at step S53. The storage and playt)ack may be pho- 
netic, since the user Is likely to be able to recognize the 
words spoken even If the words played back are some- 
what flawed. As each Image frame note Is played back, 
the user may thereby be reminded as to the content of 
that image frame and proceed at step 854 to Invoke the 
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voice recognition process of Figure 7 and conipose the 
message to appear as the print annotation. At step S55, 
the composed arid recognized message is again stored 
in a memory module 38 as described above. 

At step S56 the decision is made to continue with s 
the exposures of any remaining image frames of the film 
strip as described above, and. if so, then the process 
starts over with the next exposure at step S50. If not. then 
the transfer-toiDrint process of Figure 8 is invoked at step 
S57 as described above. Thus, the use of the camera io 
may be interrupted at any time to play back the stored 
notes, edit and compose the message, and store or 
record it, at a time later than the time of exposure and 
whether or not all image frames of the film strip are 
exposed. 75 

In respect to the emtxxJiment of Figures 3 and 4, the 
above method may be followed by tennporally storing the 
notes in memory 26 in step S51 as each image frame is 
exposed. Then, preceding step S53. it would be neces- 
sary to rewind the film strip back to the first image frame 20 
that does not have a message previously stored in the 
MOF layer 13 in accordance with step S55. Then steps 
S53 and S54 would be followed as described above to 
prepare each message for recording in the MOF layer 1 3 
of the associated image frame. Recording would then 25 
take place as the film is advanced to the nest image 
frame. 

In a further variation, the film advance motor 29 of 
the camera system may be reversible to allow the film 
strip to be advanced in a first direction to effect the tem- 30 
porary recording of the voiced-in notes in MOF layer 
tracks as each image frame is advanced before or after 
the exposure at step S51. Then, at step S53, the film 
advance may be reversed to rewind the film strip back to 
the first image frame having a message to be edited. The 
film advance motor 29 is operated to advance the image 
frame and play back the recorded notes for that image 
frame for the user to listen to. After composition and voice 
recognition of the message, the film advance motor 29 
may be again reversed to rewind the film strip to the start- 
ing position and to then advance the film strip again while 
recording the message. This process may be repeated 
for each image frame until all messages are recorded. 

Although the present invention has been fully 
desaibed with reference to the preferred embodiments 
thereof, many modifications and variations thereof will be 
apparent to those skilled in the art without departing from 
the spirit and scope thereof. 

The invention is summarized as follows: 

1. In a photographic camera including an optical 
lens, a photographic filmstrip transport mechanism 
for advancing the filmstrip in a path of travel to and 
through an image frame exposure gate with respect 
to said optical lens, and an exposure system for 
making an exposure of the filmstrip image frame in 
the exposure gate, apparatus for recording a voice 
message composed by the camera user related to 
the exposure made or to be made for playback in 



conjunction with making prints from the photo- 
graphic images captured in tiie image frames of ttie 
filmstrip to provide for the printing of the voice mes- 
sage therewith comprising: 

speech input means into which a camera 
user may speak words of the message to be stored 
witii respect to the filmstrip image frames; 

sound processing means for processing tiie 
words spoken into the speech input means as voice 
digital data; 

means for providing reference voice digital 
data corresponding to a reference word vocabulary; 

speech recognition means for comparing tiie 
processed voice digital data to the reference voice 
digital data and recognizing processed voice digital 
data corresponding to the reference voice digital 
data while rejecting voice digital data not finding cor- 
respondence with tiie reference voice digital data; 

message memory means having memory 
locations related to each image frame of the filmstrip 
for storing recognized voice digital data; and 

means for storing the recognized voice digital 
data in said message memory means. 

2. The recording apparatus of item 1 wherein: 

said memory means comprises a virtually 
transparent magnetic layer on said filmstrip having 
a plurality of longitudinally extending parallel tracks 
tfierein; and 

said storing means further comprises a 
recording head arranged in said path of travel of said 
filmstrip and means for effecting magnetic recording 
of said recognized voice digital data in selected 
tracks for playk>ack in conjunction with making prints 
from the photographic images captured in the image 
frames of the filmstrip and a film write interface cir- 
cuit responsive to the for energizing tiie recording 
head. 

3. The recording apparatus of item 1 wherein: 

said filmstrip is attached at one end to a film- 
strip cartridge and is adapted to be enclosed witiiin 
said carfridge upon completion of exposure of alt 
image frames for removal from said camera for 
fransfer to a photofinisher to make prints therefrom; 
and 

said memory means comprises a memory 
module associated with The filmst-ip cartridge for 
storing said recognized voice digital data. 

4. TTie recording apparatus of item 3 wherein said 
memory module is detachable from said cartridge. 

5. The recording apparatus of item 1 wherein said 
means for providing reference voice digital data cor- 
responding to a word vocabulary further comprises: 

first vocabulary memory means for storing a 
fixed vocabulary of words that cannot be altered by 
tiie user; and 

second vocatxjiary memory means for stor- 
ing a adjustable vocabulary of words selected by the 
i«er. 
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6. The recording apparatus of item 5 wherein: 

said first vocabulary memory means com- 
prises a read only memory stored with said fixed 
vocabulary from which said fixed vocabulary nriay be 
read by said speech recognition means; and 5 

said second vocabulary memory means 
comprises a read and write memory into which said 
adjustable vocabulary may be written in and from 
which said adjustable vocabulary may be read by 
said speech recognition means; w 

and further comprising interface means for 
receiving said adjustable vocabulary from an exter- 
nal source and for writing said adjustable vocabulary 
into said second vocabulary memory means. 

7. The recording apparatus of item 6 particularly is 
adapted to receive said adjustable vocabulary from 

an external source associated with an event or 
attraction of photographic interest through said inter- 
face means and wherein said source may further 
comprise a camera docking station for receiving said 20 
camera and coupling said interface means to an 
input means. 

8. The recording apparatus of item 7 wherein: 

J said input means comprises a user operated 
keyfcx^ard for keying in the alphanumeric characters 25 
of words of particular interest to the user; and 

said interface means is operable to convert 
and store the alphanumeric characters of keyed in 
words as said adjustable vocabulary 

9. The recording apparatus of item 7 wherein: 30 

said input means comprises a vending 
machine operable on user selection to down load a 
data set of words of particular interest to an event or 
an attraction of interest to be photographed and 
selected by the user; and 35 

said interface means is operable to convert 
and store the down loaded data set of words as said 
adjustable vocabulary in said second vocabulary 
memory means. 

1 0. The recording apparatus of item 5 wherein: 40 

said first vocabulary memory means com- 
prises a read only memory in a detachable, inter- 
changeable memory card stored with said fixed 
vocabulary to be compared with a spoken word by 
said speech recognition means; and 45 
said apparatus further connprises: 
card receiving means in said camera for 
receiving said read only memory card for making 
connection with said speech recognition means. 

1 1 . The recording apparatus of item 5 wherein: 50 

said second vocabulary memory means 
comprises a read and write memory in a detachable, 
interchangeat)le memory card stored with said 
adjustable vocabulary to be compared with a spoken 
word by said speech recognition means; and 55 
said ai^ratus further connprises: 
card receiving means in said camera for 
receiving said read and write memory card for mak- 
ing connection with said speech recognition means. 



12. The recording apparatus of item 5 wherein: 

said first vocabulary memory means com- 
prises a read only memory stored with a portion of 
said fixed vocabulary with speaker irtdependent 
training and a further read only memory stored with 
a further portion of said fixed vocabulary and a read 
and write memory for speaker dependent training of 
said further portion of said fixed vocabulary to be 
compared with a spoken word by said speech rec- 
ognition means. 

13. The recording apparatus of item 1 further com- 
prising: 

means for indicating the un-recognized word 
of processed voice digital data to the speaker and 
for prompting the speaker to repeat the conespond- 
ing un-recognized word. 

14. The recording apparatus of item 13 further com- 
prising: 

means for playing back and audibly reproduc- 
ing the recognized and stored voice digital data for 
each image frame; and 

means operable by the speaker for editing the 
stored voice digital data by speaking the words of 
the message desired to replace the processed and 
stored voice digital data into said speech input 
means, whereby the spoken words are again sub- 
jected to processing and speech recognition. 

15. The recording apparatus of item 14 further com- 
prising: 

means operable by the speaker to indicate 
the acceptance of the edited words of the message; 
arKJ wherein: 

said storing means is responsive to the 
acceptance indication for storing the voice digital 
data corresponding thereto in said memory means. 

1 6. The recording apparatus of item 1 3 further com- 
prising: 

means for displaying the words spoken one 
at a time in a visible display; and 

means operable by the speaker for editing the 
stored voice digital data by speaking the words of 
the message desired to replace the processed and 
stored voice digital data into said speech input 
means, whereby the spoken words are again sub- 
jected to processing and speech recognition. 

17. The recording apparatus of item 16 further com- 
prising: 

means operable by the speaker to indicate 
the acceptance of the edited words of the message; 
and wherein: 

said storing means is responsive to the 
acceptance indication for storing the voice digital 
data corresponding thereto in said memory means. 

18. The recording apparatus of item 1 further com- 
prising: 

means for playing back and audibly reproduc- 
ing the recognized and stored voice digital data; and 

means operable by the speaker to erase the 
stored voice digital data and repeat the speech input 
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in an editing of the message to be printed in relation 
to the image frame. 

1 9. The recording apparatus of item 1 8 further com- 
prising: 

means operable by the speaker to indicate s 
the acceptance of the edited words of the message: 
and wherein: 

said storing means is responsive to the 
acceptance indication for storing the voice digital 
data con'esponding thereto in said menx)ry means. io 

20. The recording apparatus of item 1 8 further com- 
prising: 

means operable by the speaker for editing the 
stored voice digital data by speaking the words of 
the message desired to replace the processed and is 
stored voice digital data into said speech input 
means, whereby the spoken words are again sub- 
jected to processing and speech recognition. 

21. In a photographic camera including an optical 
lens, a photographic filmstrip transport mechanism 20 
for advancing the filmstrip in a path of travel to and 
through an image frame exposure gate with respect 

to said optical lens, and an exposure system for 
making an exposure of the filmstrip image frame in 
the exposure gate, a method of recording a voice 25 
message related to the exposure made for reproduc- 
tion in conjunction with making prints from the pho- 
tographic images captured in the image frames of 
the filmstrip, including the printing of the associated 
message, comprising the steps of: 30 

processing spoken words of a message to be 
stored with respect to each exposure of an image 
frame into a camera speech input means at the time 
of making the image frame exposure as voice digital 
data; 35 

providing reference voice digital data corre- 
sponding to a word vocabulary: 

in a speech recognition operation, comparing 
the processed voice digital data to reference voice 
digital data and recognizing processed voice digital 40 
data corresponding to the reference voice digital 
data while rejecting voice digital data not finding cor- 
respondence with the reference voice digital data: 
and 

storing the recognized voice digital data into 45 
memory locations related to each image frame of the 
filmstrip of a memory means detachable from the 
camera to accompany the filmstrip in the printing of 
the image frames. 

22. The method of item 21 wherein said step of pro- so 
viding a reference word vocatxjiary further com- 
prises the steps of: 

providing a fixed vocabulary of words associ- 
ated with data related to the photographic exposure 
of filmstrip image frames in a fixed vocabulary mem- 55 
ory: and 

providing an adjustable vocabulary of words 
selected by the user in a adjustat)le vocabulary 
menrx)ry. 



23. The method of item 22 wherein said step of pro- 
viding an adjustable vocabulary comprises: 

providing a vocabulary source in the memory 
of a docking station for receiving the camera and 
making a connection with said camera adjustable 
vocabulary memory; 

inserting the camera into the docking station 
to make the connection between the docking station 
memory and said camera adjustable vocabulary 
memory; and 

down loading the adjustable vocabulary from 
said docking station memory into said camera 
adjustable vocabulary memory 

24. The method of item 22 wherein said step of pro- 
viding a adjustable vocabulary comprises: 

providing sets of adjustable vocabulary 
sources in interchangeable memory cards; 

selecting a memory card related to an event 
or attraction of photographic interest; and 

inserting the interchangeable mennory card in 
a card receiving slot oi said camera to thereby pro- 
vide said camera adjustable vocabulary memory 

25. In a photographic camera including an optical 
lens, a photographic filmstrip transport mechanism 
for advancing the filmstrip in a path of travel to and 
through an image frame exposure gate with respect 
to said optical lens, and an exposure system for 
making an exposure of the filmstrip image frame in 
the exposure gate, a method of recording a voice 
message related to the exposure made for reproduc- 
tion in conjunction with making prints from the pho- 
tographic images captured in the image frames of 
the filmstrip, including the printing of the associated 
message, comprising the steps of: 

processing spoken words of a common mes- 
sage to be stored with respect to the exposures of 
more than one image frame into a camera speech 
input means, at a time prior to making or in associ- 
ation with making at least one of the image frame 
exposures, as voice digital data; 

providing reference voice digital data con^e- 
sponding to a word vocabulary; 

in a speech recognition operation, comparing 
the processed voice digital data to reference voice 
digital data and recognizing processed voice digital 
data corresponding to the reference voice digital 
data while rejecting voice digital data not finding cor- 
respondence with the reference voice digital data; 
and 

at the time of exposure of image frames, 
selectively storing the recognized voice digital data 
corresponding to the common message into mem- 
ory locations related to each image frame of the film- 
strip in a memory means detachat)le from the 
camera to acconrpany the filmstrip in the printing of 
the image frames. 

26. The method of item 25 further comprising the 
step of: 

selecting the number of image frames that 
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the common message is to be stored with at the time 
that the voiced words of the common message are 
processed. 

27. In a photographic camera including an optical 
lens, a photographic fiimstrip transport mechanism s 
for advancing the fiimstrip in a path of travel to and 
through an image frame exposure gate with respect 

to said optical lens, and an exposure system for 
making an exposure of the fiimstrip image frame in 
the exposure gate, a method of recording a voice io 
message related to the exposure made for reproduc- 
tion in conjunction with making prints from the pho- 
tographic images captured in the image frames of 
the fiimstrip, including the printing of the associated 
message, comprising the steps of: is 

at the time that the exposures of image 
frames are made, storing spoken notes within the 
camera for playback by the user; 

at any time selected by the user, playing back 
the stored spoken notes to enable composition of a 20 
voice message to be printed with prints made from 
the exposed image frames; 

processing the spoken words of a message 
to be stored with respect to each image frame into 
a camera speech input means as voice digital data; 25 

providing reference voice digital data corre- 
spondirtg to a word vocabulary; 

in a speech recognition operation, comparing 
the processed voice digital data to reference voice 
digital data and recognizing processed voice digital 30 
data corresponding to the reference voice digital 
data while rejecting voice digital data not finding cor- 
respondence with the reference voice digital data; 
and 

storing the recognized voice digital data into 35 
memory locations related to each image frame of the 
fiimstrip of a memory means detachable from the 
camera to acconrtpany the fiimstrip in the printing of 
the image frames. 

28. The recording apparatus of item 5 wherein: 40 

said first vocabulary memory means com- 
prises a read only memory in a detachable, inter- 
changeable memory card stored with said fixed 
vocabulary with speaker independent training to be 
compared with a spoken word by said speech rec- 45 
ognition means and a further read only memory 
stored with a further portion of said fixed vocabulary 
and a read and write memory for speaker dependent 
training of said further portion of said fixed vocabu- 
lary to be compared with a spoken word by said so 
speech recognition means; and 

said apparatus further comprises: 
card receiving means in said camera for 
receiving said read only menfx)ry card for making 
connection with said speech recognition means. ss 



Claims 

1 . In a photographic camera including an optical lens, 
a photographic fiimstrip transport mechanism for 
advancing the fiimstrip in a path of travel to and 
through an image frame exposure gate with respect 
to said optical lens, and an exposure system for 
making an exposure of the fiimstrip image frame in 
the exposure gate, apparatus for recording a voice 
message composed by the camera user related to 
the exposure made or to be made for playback in 
conjunction with making prints from the photo- 
graphic images captured in the image frames of the 
fiimstrip to provide for the prirtting of the voice mes- 
sage therewith comprising: 

speech input means into which a camera 
user may speak words of the message to be stored 
with respect to the fiimstrip image frames; 

sound processing means for processing the 
words spoken into the speech input means as voice 
digital data; 

means for providing reference voice digital 
data corresponding to a reference word vocabulary; 

speech recognition means for comparing the 
processed voice digital data to the reference voice 
digital data and recognizing processed voice digital 
data corresponding to the reference voice digital 
data while rejecting voice digital data notfirKling cor- 
respondence with the reference voice digital data; 

message memory means having memory 
locations related to each image frame of the fiimstrip 
for storing recognized voice digital data; and 

means for storing the recognized voice digital 
data in said message memory means. 

2. TTie recording apparatus of Claim 1 wherein: 

said memory means comprises a virtually 
transparent magnetic layer on said fiimstrip having 
a plurality of longitudinally extending parallel tracks 
therein; and 

said storing means further comprises a 
recording head arranged in said path of travel of said 
fiimstrip and means for effecting magnetic recording 
of said recognized voice digital data in selected 
tracks for playback in conjunction with making prints 
from the photographic images captured in the image 
frames of the fiimstrip and a film write interface cir- 
cuit responsive to the for energizing the recording 
head . 

3. TTie recording apparatus of Claim 1 wherein: 

said fiimstrip is attached at one end to a film- 
strip cartridge and is adapted to be enclosed within 
said cartridge upon completion of exposure of all 
image frames for removal from said camera for 
transfer to a photof inisher to make prints therefrom; 
and 

said memory means comprises a memory 



13 



12/3/07, EAST 



Version: 2.1.0.14 



25 



EP0 699 941 A1 



26 



module associated with the filmstrip cartridge for 
storing said recognized voice digital data. 

4. The recording apparatus of Claim 3 wherein said 
memory module is detachable from said cartridge. 

5. The recording apparatus of Claim 1 wherein said 
means for providing reference voice digital data cor- 
responding to a word vocabulary further comprises: 

first vocabulary memory means for storing a 
fixed vocabulary of words that cannot be altered by 
the user; and 

second vocabulary memory means for stor- 
ing a adjustable vocabulary of words selected by the 
user. 

6. The recording apparatus of Claim 5 wherein: 

said first vocabulary memory means com- 
prises a read only memory stored with said fixed 
vocabulary from which said fixed vocabulary may be 
read by said speech recognition means; and 

said second vocabulary memory means 
comprises a read and write memory into which said 
adjustable vocabulary may be written in and from 
which said adjustable vocabulary, may be read by 
said speech recognition means; 

and further comprising interface means for 
receiving said adjustable vocabulary from an exter- 
nal source and for writing said adjustable vocabulary 
into said second vocabulary memory means. 

7. In a photographic camera including an optical lens, 
a photographic filmstrip transport mechanism for 
advancing the filmstrip in a path of travel to and 
through an image frame exposure gate with respect 
to said optical lens, and an exposure system for 
making an exposure of the filmstrip image frame in 
the exposure gate, a method of recording a voice 
message related to the exposure made for reproduc- 
tion in conjunction with making prints from the pho- 
tographic images captured in the image frames of 
the filmstrip, including the printing of the associated 
message, comprising the steps of: 

processing spoken words of a message to be 
stored with respect to each exposure of an Image 
frame Into a camera speech input means at the time 
of making the image frame exposure as voice digital 
data; 

providing reference voice digital data corre- 
sponding to a word vocabulary; 

in a speech recognition operation, comparing 
the processed voice digital data to reference voice 
digital data and recognizing processed voice digital 
data corresponding to the reference voice digital 
data while rejecting voice digrtal data not finding cor- 
respondence with the reference voice digital data; 
and 

storing the recognized voice digital data into 
memory locations related to each image frame of the 



filmstrip of a memory means detachable from the 
camera to accompany the filmstrip in the printing of 
the image frames. 

5 8. The method of Claim 7 wherein said step of provid- 
ing a reference word vocabulary further comprises 
the steps of: 

providing a fixed vocabulary of words associ- 
ated with data related to the photographic exposure 

10 of filmstrip image frames in a fixed vocabulary mem- 
ory; and 

providing an adjustable vocabulary of words 
selected by the user in a adjustable vocabulary 
memory. 

15 

9. The method of Claim 8 wherein said step of provid- 
ing an adjustable vocabulary comprises: 

providing a vocabulary source in the memory 
of a docking station for receiving the camera and 
20 making a connection with said camera adjustable 
vocabulary memory; 

inserting the camera into the docking station 
to make the connection between the docking station 
memory and said camera adjustable vocabulary 
25 memory; and 

down loading the adjustable vocabulary from 
said docking station memory into said camera 
adjustable vocabulary memory 

10. The method of Claim 8 wherein said step of provid- 
ing a adjustable vocabulary comprises: 

providing sets of adjustat)le vocabulary 
sources in interchangeable memory cards; 

selecting a memory card related to an event 
or attraction of photographic interest; and 

inserting the interchangeable memory card in 
a card receiving slot of said camera to thereby pro- 
vide said camera adjustable vocabulary memory 
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